Versions:
xan 0.57.0, published by médialab Sciences Po, is a high-performance command-line CSV processor written in Rust that targets data workers who need to manipulate multi-gigabyte delimited files without leaving the shell. Originally forked from BurntSushi’s xsv, the utility has been almost completely rewritten to support the lab’s social-science research pipelines, adding domain-specific functions for lexicometry, network analysis and light scraping while still exposing the classic repertoire of slice, filter, sort, join and aggregate operations. A SIMD-accelerated parser and optional multithreading let typical tasks finish as fast as the local machine allows, and an embedded expression language—purpose-built for tabular data—delivers runtime speeds well above those of embedded Python, Lua or JavaScript interpreters. Because academic sources often arrive in discipline-specific variants of comma-separated text, the tool also recognizes formats common to web archives (.cdx), bioinformatics (.vcf, .gtf, .sam, .bed) and other CSV-adjacent standards, converting them on the fly to a uniform data stream that can be chained through xan’s composable sub-commands. Six numbered releases have appeared to date, with 0.57.0 being the current stable milestone. The program is classified under “Data Processing & CSV Tools” and is aimed at researchers, system administrators and developers who require reproducible, scriptable transformations of large textual data sets. The software is available for free on get.nero.com, with downloads provided via trusted Windows package sources (e.g. winget), always delivering the latest version, and supporting batch installation of multiple applications.
Tags: